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ABSTRACT 



A method for performing background disk sector analy- 
sis for drives, including drives dedicated to redundancy 
and/or fault recovery techniques, in an intelligent, mi- 
croprocessor based disk array. The method directs the 
microprocessor to wait a specified time and test for disk 
activity. In the absence of disk activity, the disk control- 
ler is directed to generate a read request for a disk mem- 
ory location within the array. A return code following 
the array is checked to determine if the read failed, 
indicating a disk drive media failure. The disk controller 
is then notified if a failure occurs. The processor again 
checks for disk array activity and in the absence of 
activity issues a read request for successive locations 
within the array, thereby reading all disk memory loca- 
tions within the array. 

9 Claims, 14 Drawing Sheets 
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limited availability of interrupts and direct memory 

INTELLIGENT DISK ARRAY CONTROLLER access (DMA) channels. The complexity of the avail- 
able components did not require greater flexibility or 
FIELD OF THE INVENTION capabilities of the interface standard to allow the neces- 

The present invention relates to the control of disk 5 *ary operations to occur. This interface standard was 
drives within a computer system and more particularly satisfactory for a number of years, 
to a method for carrying out disk drive sector analysis As is inevitable in the computer and electronics m- 
in a background mode in an intelligent mass storage disk dustry, capabilities of the various components available 
drive array subsystem for a personal computer. increased dramatically. Memory component prices 

1° dropped in capacities and speeds increased. The perfor- 
DESCRIPTION OF THE RELATED ART mance rate and capacities of the mass storage subsys- 

Microprocessors and the personal computers which terns increased, generally by the incorporation of hard 
utilize them have become more powerful over the re- disk units for previous floppy disk units. The video 
cent years. Currently available personal computers processor technology improved so that high resolution 
have capabilities easily exceeding mainframe computers 15 color systems were reasonably affordable. These devel- 
of 20 to 30 years ago and approach capabilities of many opments all pushed the capabilities of the existing IBM 
mainframe and minicomputers currently manufactured. PC interface standard so that the numerous limitations 
Microprocessors having word sizes of 32 bits wide are in the interface standard became a problem. With the 
now widely available, whereas in the past, 8 bits was introduction by Intel Corporation of 80286, IBM devel- 
conventional and 16 bits was common. oped a new, more powerful personal computer called 

Personal computer systems have developed over the the AT. The 80286 has a 16 bit data path and 24 address 
years and new uses are being discovered daily. The uses lines ^ that it can directly address 16 Mbytes of mem- 
are varied and, as a result, have different requirements ory< j n addition, the 80286 has an increased speed of 
for various subsystems forming a complete computer operation and can easily perform many operations 
system. Because of production volume requirements 25 wfaich tmA m$ performance limits, 
and resultant economics of scale it is desirable that as It was desirab i c ^ ^ existing subsystem circuit 
many common features as possible are combined into ^ ble of ^ ^ m the new AT( ^ ^ 

high volume unit Sj This has happened in the personal mtcrface standard uscd m the PC was utilized md cx . 
computer area by developing a basic system unit which A new mterface standard was deV eloped, which 

generally contains a power supply, provisions for physi- 30 ^ becQme ^ own ^ ^ Milstry standard 
cally mounting the various mass storage devices and a ^ conncctof fof ^ Wfts 

system board, which in turn incorporates a micro- ^ J ^ additional lines for the signals used in 

processor, microprocessor related circuitry, connectors ° ' wji«-ju«iuiuw w i*u« ■»« u« e » 

for receiving circuit boards containing other subsys- the extension. These lines mcluded additional address 

terns, circuitry related to interfacing the circuit boards 35 ** d ^ * the use of the 24 bit addressing 

to the microprocessor and memory. The use of connec- *apabiht y ™* " bit data transfers, additional interrupt 

tors and interchangeable circuit boards allows subsys- direct memory access lines and lines to indicate 

terns of the desired capability for each computer system w * cth * r subsystems circuit board was capable of 

to be easily incorporated into the computer system. The the extended features. While the address values 

use of interchangeable circuit boards necessitated the 40 « presented by the 80286 microprocessor relatively 

development of an interface or bus standard so that the m ™ operation cycle, the PC mterface standard 

subsystems could be easily designed and problems could not utilize the mitial portions of the address avail - 

would not result from incompatible decisions by the abil ity because of different timing standards for the 8088 

system unit designers and the interchangeable circuit ** omid which & c pc mterface Was designed. This 

board designers. 45 hmited the speed at which operations could occur be- 

The use of interchangeable circuit boards and an the y were now limited to the interface standard 

interface standard, commonly called a bus specification memory timing specifications and could not operate at 

because the various signals are provided to all the con- ^e rate available with the 80286. Therefore, the newly 

nectors over a bus, was incorporated into the original added address signals previous available, but the newly 

International Business Machines Corporations (IBM) 50 added signals were available at an early time in the 

personal computer, the IBM PC The IBM PC Milked cycle. This change in the address single timing allowed 

in Intel Corporation 8088 as the microprocessor. The operations which utilized the extended portions of the 

8088 has an 8 bit, or 1 byte, external data interface but architecture to operate faster. With a higher perfor- 

operates on a 16 bit word internally. The 8088 has 20 mance components available it became possible to have 

address lines, which means that it can directly address a 55 a master unit other than the system microprocessor or 

maximum of 1Mbyte of memory. In addition, the mem- direct memory access controller operating the bus. 

ory components available for incorporation in the origi- However, because of the need to cooperate with circuit 

nal IBM PC were relatively slow and expensive as com- boards which operated under the new 1 6 bit standard or 

pared to current components. The various subsystems the old 8 bit standard, each master unit was required to 

such as video output units or mass storage units, were 60 understand and operate with all the possible combina- 

not complex and also had relatively low performance tions of circuit boards. This increased the complexity of 

levels because of the relative simplicity of the devices the master unit and resulted in a duplication of compo- 

available at a reasonable costs at that time. nents, because the master unit had to incorporate many 

With these various factors and component choices in of the functions and features already performed by the 

mind, an interface standard was developed and used in 65 logic and circuitry on the system board and other mas- 

the IBM PC. The standard utilized 20 address lines and ter units. Additionally, the master unit was required to 

8 data lines, individual lines to indicate input or output utilize the direct memory access controller to gain con- 

(t/O) or memory space read/write operations, and had trol of the bus, limiting prioritizing and the number of 
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master units possible in a given computer system. The Disk array subsystems may also be configured to pro- 
capabilities of components continued to increase. Mem- vide data redundancy and/or data recovery capability, 
ory speeds and sizes increased, mass storage units and Two data redundancy and recovery techniques have 
silt increased, video unit resolutions increased and Intel generally been used to restore data in the event of a 
Corporation introduced the 80386 microprocessor. The 5 catastrophic drive failure. One technique is the mir- 
increased capabilities of the components created a de- rored drive. A mirrored drive in effect creates a redun- 
sire for the use of master units, but the performance of dant data drive for each data drive. A write to a disk 
a master unit was limited by the ISA specification and array utilizing the mirrored drive fault tolerance tech- 
capabilities. The 80386 could not be fully utilized be- rique will result in a write to the primary data disk and 
cause it offered the capability directly address 4 Gbytes W a write to its mirror drive. This technique results in a 
of memory using 32 bits of address and could perform minim u m loss of performance in the disk array. How- 
32 bit wide data transfers, while the ISA standard al- cvcr « cxist certain disadvantages to the use of 
lowed only 16 bits of data and 24 bits of address. The mirrored drive fault tolerance techniques. The primary 
local area network (LAN) concept, where information disadvantage is that this technique uses 50% of total 
and file stored on one computer called server and dis- 15 data storage available for redundancy purposes. This 
tributed to local work stations having limited or no mass results m a relatively high cost of storage per available 
storage capabilities, started becoming practical with the ^yte. 

relatively low cost of high capability components Another technique is the use of a parity scheme 

needed for adequate servers and the low costs of the bl ? cks writtCD t0 various dnves 

components for work stations. An extension similar to 20 * e m ^ and ^ a known exclusive or (XOR) 

that performed in developing the ISA could be imple- ^^« c * create panty information which is written 

mented to utilize the 80386's capabilities. However, this t0 a u I 5 sen ? d . or P" 1 * *? w 10 * c «»* The advantage 

type of extension would have certain disadvantages. to tms technique » * *iay be used to minimize the 

With the advent of the LAN concept and the high „ "J" 1 of dat ! stora *r d ^«ited to data redundancy 

performance requirements of the server and of video 25 and ** ^ 8 dnve array the panty technique 

, . . \ A . , . . . , _ . would call for one drive to be used for panty lnforma- 

graphics work stations used in computer-aided design cof - ^ . \ .r^r ; ,\ *, """ 1UAO 

j • *• i* i_ j * - tion: 12.5% of total storage is dedicated to redundancy 

and animation work, the need for a very high data trans- _ ' A . <A ^ ? • j j- r L 

, . . _ Via -* • • *i * *u * as compared to 50% using the mirrored dnve fault 

fer rates became cntical. An extension similar to that * u • m. r ** j • * u • 

n j j i- *uica u . * • tolerance technique. The use of panty dnve techniques 

performed in developing the ISA would not provide 3Q decreases ^ ^ Qf ] [ However, there are a 

his capability, even lightly shorter standard cycle numbcf of disadvantagcs to ^ of ^ ' fault toler . 

times were provided because this would still leave the ^ ^ ^ ^ \ s the loss of 

performance below desired levels. performance within the disk array as the parity drive 

With the increased performance of computer sys- must ^ u ^ ated ^ time a ^ drive is ^ 

terns, it became apparent that mass storage subsystems, 35 ^ must ^ the XQR ^ m Qrder tQ wrjte tQ 

such as fixed disk dnves, played an increasingly impor- the ^ drive „ wcll ^ wdtm ^ ^ tQ ^ datfl 

tant role in the transfer of data to and from the com- oMves 

puter system. In the past few years, a new trend in mass A ^ ond ^ ^ use of th e system pro . 

storage subsystems has emerged for improving data ccs$ot t0 doTm XOR ^ ^0^^ generation, 

transfer performance, capacity and reliability. This is 40 ^ requires that the drive data be transferred from the 

generally known as a disk array subsystem. A number of drivcs> to a transfcr buffer, to the system processor local 

reference articles on the design of disk arrays have been mem0 ry to create the XOR parity information and that 

published in recent years. These include "Consider- ^ data ^ ^ back t0 the ^ ^ transfer 

ations in the Design of a RAID Prototype" by M. bufTer. As a result, the host system processor encounters 

Schulze, Report No. UCB/CSD 88/448, August, 1988, 45 a sign i ficant overhead in managing the generation of the 

Computer Science Division, University of California, X OR parity data. The use of a local processor within 

Berkeley; "Coding Techniques for Handling Failures in thc d i s k array controller also encounters many of the 

Large Disk Arrays" by G. Gibson, L. Hellerstein, R. same problems that a system processor would. The 

Karp, R. Katz and D. Patterson, Report No. drive data must again go from the drives to a transfer 

UCB/CSD 88/477, December 1988, Computer Science 50 buffer to local processor memory to generate XOR 

Division, University of California Berkeley; and "A parity information and then back to the parity drive via 

Case Study for Redundant Arrays of Inexpensive Disks the transfer buffer. 

(RAID)" by D. Patterson, G. Gibson, and R. Katz, Related to this field of data error correction is U.S. 
presented at the June 1988 ACM SIGMOD Conference p at . No. 4,775,978 for a data error correction system, 
in Chicago, HI. 55 Implementation of either mirror or parity fault toler- 
One reason for wanting to build a disk array subsys- ance techniques in a disk array significantly improves 
tem is to create a logical device that has a very high data data redundancy and recovery capability within a disk 
transfer rate. This may be accomplished by ganging array. However, conditions may arise which may lead 
multiple standard disk drives together and transferring to data loss within a disk array despite the implementa- 
data to or from these drives to the computer system 60 tion of mirror or parity fault tolerance techniques, 
memory. If n drives are ganged together, then the effec- These conditions may arise when disk sectors on a p ar- 
rive data transfer rate is increased n times. This tech- ity or mirror disk fail, followed by failure of corre- 
nique, called striping, originated in the super computing sponding disk sectors on data disks. In such instances, 
environment where the transfer of large amounts of the data cannot be regenerated as either the mirror or 
data to and from secondary storage is a frequent re- 65 parity disk has already failed and the failure not gener- 
quirement With this approach, multiple physical drives ally detected or corrected. 

may be addressed as a single logical device and may be For example, a database application may include a 

implemented either through software or hardware. number of data records which are written initially to a 
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disk array and are read multiple times without the re- troller completes the read request first Thus, though 

cords being updated . During the initial write operation, one disk may nominally be designated as the mirror 

the data written to the data drives is used to generate disk, it may act as the primary data disk in many in- 

XOR parity information. The data is written to data stances. Additionally, if a sector on either drive goes 

drive sectors on data disks within the array and the 5 oac i, the error will be provided to the operating system 

corresponding XOR parity information is written to because each controller must respond to the request, 

corresponding sectors on the parity drive. When a data cven d thc ^ & ^ from ^ otner drive, the 

drive fails, the data may be regenerated using the XOR crror is repor ted 

parity formation from the parity drive and the remain- n it will ^ apprcciated th at data loss may m 

mg da* drives. While catastrophic dnve failure > may 10 „ even when ^ or ^ fault to i era nc/tech- 

occur, the more common mode of failure for a hard disk • ue$ m u$e< j 
is for one or more disk sectors on a disk track to become 

corrupted or unreadable. Disk sector data may also be SUMMARY OF THE INVENTION 

regenerated utilizing the remaining data disks and XOR # . . , vu , 

parity disk. The Responding sectors of the remaining 15 The present mvenfcon is for use : with a ^personal com- 

data disks and the parity disk may be read to regenerate £*? tav »* a , ? ftult ^^^Obg^^my con- 

the lost disk sector data. The regenerated data may then * oUer t ? e ^oUer being capable of managing 

be written to a replacement disk as part of a total data & ; operation of an array of up to 8 standard integrated 

disk regeneration process or the regenerated data may disk drives connected in dnve pans without supervision 

be remapped to new sectors on the existing disk. 20 ? y ^ com P utcr svstcm host Specifically, the present 

In this environment, the disk controller generally will invention is directed toward a method for performing 
not access the parity disk unless a write to the corre- dj* Sttr f ace analysis by generating read requests for all 
spending data drive sectors occurs or unless the parity disks within an array, including parity and mirror disks, 
sectors must be read to generate data for one or more of •* a background task to verify disk sector integrity, 
the data drives. However, the parity disk is just as sus- 25 The method of the present invention is intended for 
ceptible to disk failure as the data drives within the use m an intelligent disk array subsystem controller 
array. During the course of operations, sectors on the having a local microprocessor operating in a multi-task- 
parity drive may become corrupted or fail. This failure ing environment. The method of the present invention 
would not normally be detected as the parity disk is not initiates a background task after a predetermined time in 
being accessed by the disk controller/Thus, the XOR 30 which no disk activity takes place, The background task 
parity information corresponding to sectors on the data creates a series of read requests which are processed by 
disks may be corrupted or lost without thc disk control- the disk controller which will cause all disk sectors 
ler or the user being aware of the parity disk sector within the array, including any disks allocated to fault 
failure. Should a corresponding data drive sector subse- tolerance and data redundancy, to be read and therefore 
quently become corrupted or otherwise fail, the disk 35 checked for sector integrity. The individual read re- 
array will be unable to regenerate the data despite the quests are scheduled by the local processor and return a 
use of parity fault tolerance techniques. success or failure code indicating the integrity of the 

A similar situation may occur where mirrored drive information in the sector read. The method includes a 

fault tolerance techniques are utilized in a disk array. As mcans f or notifying the host processor of a read failure 

in the parity fault tolerance situation, instances will arise 40 code so that corrective action may be taken. The pres- 

when a record is initially written to a data drive and is cnt invention will continue to generate read requests in 

read many times without the data record being updated. a background mode until all sectors within the disk 

The initial write request will generate a write command ^ been read and verified. Upon completion of 

to the data disk and its corresponding ; mirror disk Sub- rcadin ^ scctorSf ^ t ^ ^ 

sequent read requests wiU be directed by the disk con- 45 ^ * he read verification for the entire array again. 

• VL^V 1 ? ^present invention does not adversely affect disk 

S ~ * T • Tl° T ^ B£ .!° ntrCl " array subsystem performance as a result of this read 
ler will not access the mirror disk unless a write com- «r ■ *• IT +u u i _ i .,iu • 
mand is issued to the data disk or the mirror disk is read X^^on process as the background task will be mac- 
to regenerate data in the event of a data disk failure. 50 tlve wh f n dlsk «™y subsystem is actively perform- 
Over time sectors on the mirror disk may become |ng read or wnte operattons^ The background task wdl 
corruptedorfau.Thisfailurewillnotbedetectedbythe ^ yc f Predetermined time after disk 
disk array controller or the user as the read requests will 0P**k» **ve completed and tests to determine if the 
be directed to the data disk and not the mirror disk. ^ * 8tiU Performing read or write operations. If 
Thus, the drive controller and user are unaware of the 55 stm busv > ^ background task will again go inactive for 
rnirror disk failure. The disk array will be unable to ^ predetermined period. If the disk array is inactive, 
regenerate data should data disk sectors subsequently * e *** ™^ resume issuing read requests for the disk 
become corrupted or fail if their corresponding mirror m ^ ^ present invention will continue to issue read 
drive sectors have already become corrupted or failed. requests until all disk memory locations within the array 
This situation will not occur when the mirrored drive 60 nave been verified. Following completion of this task, 
is managed using duplexed disk controllers. In a duplex the present invention will again begin issuing read re- 
environment, separate controllers are used to manage quest for the disk array, repeating the above process, 
the data and mirrored «W.v» The initial write request DESCRIPTION OF THE DRAWINGS 
will result in a write command being generated by each 

of the controllers to their respective drives. Subsequent 65 A better understanding of the invention can be had 

read requests are also sent to both drive controllers. The when the following detailed description of the pre- 

operating system will accept the data from either the ferred embodiment is considered in conjunction with 

data or miiror disk controller based upon which con- the following drawings, in which: 
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FIGS. 1, 2A-2B arc a schematic block diagrams of a coprocessor 22 is preferably an Intel 80387 and/or Wei- 
computer system in which the present invention may be tek WTL 3167 numeric coprocessor interfacing with 
practiced; the local processor bus 26 and the processor 20 in the 

FIG. 3 is a schematic block diagram of an intelligent conventional manner. The cache RAM 28 is preferably 
disk array controller environment in which the present 5 suitable high-speed static random access memory which 

invention may be practiced; interfaces with the address and data elements of bus 96 

FIG. 4 is a flow diagram of the manner in which the under control of the cache controller 24 to carry out 
present invention resumes the generation of read re- required cache memory operations. The cache control- 
quests directed toward the disk array following a reset j er 24 is preferably an Intel 82385 cache controller con- 
condition on the computer system upon which the pres- 10 f lgurw j to operate in two-way set associative master 
ent invention is being practiced; mode FIGS t md 2j the components are the 33 

FIGS 5A-5D are flow diagrams of the manner in versions of ^ rcspectivc ^ Addrcss 

which the present invention generates read requests for 34 ^ ^ u mtcrface ^ cache 

^V\T.L n ay i controller 24 with the processor 20 and provide a local 

v?* 6A ^ d f?^^^ w ^^/^ en ^ crm » bus interface between the local processor bus 26 and a 
which the present invention initialize disk array pa™. host bus u Circmt ^ ^ lo ^ c ^ cuit which oyi6es 

SarTav * ^ a bus rcad * to control access to the local bus 26 

FIGS. 7A and 7B are portions of a task for scheduling m ^f e ^ en *f nc * ^^^J^™^ 

logical read and write requests pertinent to the present 20 *JP*?? that the next address of 

invention- and data or code to be utilized by subsystem elements in 

FIG. fi'is a flow diagram of the manner in which the W^d address mode can be placed on the local bus 

present invention activates the generation of read re- 

quests after a specified time of disk array inactivity; Noncacheable memory address map programmer 30 

FIG. 9 is a flow diagram of the manner in which the 25 ''^P"*** 8 with processor 20 and the noncacheable 

present invention determines if the disk array system is address memory 34 to map noncacheable memory loca- 

currently performing read or write operations or is tions. The noncacheable address memory 32 is utilized 

otherwise busy. to designate areas of system memory that are noncache- 
able to avoid many types of cache memory incoher- 

DETAILED DESCRIPTION OF THE encVt bus request logic circuit 42 is utilized by the 

PREFERRED EMBODIMENT processor 20 and associated elements to request access 



Table of Contents to the host bus 44 in situations such as when requested 

I. Computer System Overview data * not located m cw * e memor y 28 and access t0 

D. Disk Array Controller system memory is required. > 

III. Background Surface Analysis 35 In the drawings system C is configured haying the 

A. Restart Surface Analysis processor bus 26, the host bus 44, an extended industry 

B. Surface Analysis Task standard architecture (EISA) bus 46 (FIG. 2) and an X 

C. Initialization of Surface Analysis bus 90 ^ details thc portion of the system illustrated 

D. Scheduler Operations m FIG - 2 ^ not discussed in detail below are not 

E. Timer Function 40 significant to the present invention other than to illus- 

F. Busy Function trate & 11 example of a fully configured computer system. 

IV. Conclusion The portion of system C illustrated in FIG. 2 is essen- 

tially a configured EISA system which includes the 
I. Computer System Overview necessary EISA bus 46, and EISA bus controller 48, 

Referring now to FIGS. 1 and 2, the letter C desig- 45 data latches and transceivers 50 and address latches and 
nates generally a computer system upon which the buffers 52 to interface between the EISA bus 445 and the 
current invention may be practiced. For clarity, system host bus 44, Also illustrated in FIG. 2 is an integrated 
C is shown in two portions, with the interconnections system peripheral 54, which incorporates a number of 
between FIGS, 1 and 2 designated by reference to the the elements used in an EISA-based computer system, 
circled numbers one to eight. System C is comprised of 50 The integrated system peripheral (ISP) 54 includes a 
a number of block elements interconnected via four direct memory access controller $6 for controlling ac- 
cuses, cess to main memory 58 (FIG. 1) or memory contained 

In FIG. 1, a computer system C is depicted. A central in EISA slots and input/output (I/O) locations without 
processing unit (CPU) comprises a processor 20, a nu- the need for access to the processor 20. The main mem- 
merical coprocessor 22 and a cache memory controller 55 ory array 58 is considered to be local memory and com- 
24 and associated logic circuits connected to a local prises a memory circuit array of size suitable to accom- 
proccssor bus 26. Associated with cache controller 24 is modate the particular requirements of the system. The 
high speed cache data random access memory (RAM) ISP 54 also includes interrupt controllers 70, nonmaska- 
28, noncacheable memory address map programming ble interrupt logic 72 and system timers 74 which allow 
logic circuitry 30, noncacheable address memory 32, 60 control of interrupt signals and generate necessary tim- 
address exchange latch circuitry 34 and data exchange ing signals and wait states in a manner according to the 
transceiver 36. Associated with the CPU also are local EISA specification and conventional practice. In the 
bus ready address enable logic circuit 40 a logic circuit preferred embodiment, processor generated interrupt 
38, next address enable logic 40 and bus request logic requests are controlled via dual interrupt control cir- 
circuit 42. 65 cuits emulating and extending conventional Intel 8259 

The processor 20 is preferably an Intel 80386 micro- interrupt controllers. The ISP 54 also includes bus arbi- 
processor. The processor 20 has its control, address and tration logic 75 which, in cooperation with the bus 
data lines interfaced to the local processor bus 26. The controller 48, controls and arbitrates among the various 
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requests for the EISA bus 46 by the cache controller 24, control, address and data lines interfaced to the BMIC 

the DMA controller 56 and bus master devices located 118, CPC 120, and a transfer channel controller 124. 

on the EISA bus 46. Further, the local processor 122 is also interfaced to 

The main memory array 58 is preferably dynamic local read only memory (ROM) 126 and dynamic ran- 

random access memory. Memory 58 interfaces with the 5 dom access memory (RAM) 128 located within the disk 

host bus 44 via a data buffer circuit 60, a memory con- array controller 112. 

troller circuit 62 and a memory mapper 68. The bufTer The transfer channel controller (TCC) 124 controls 

60 performs data transceiving and parity generating and the operation of four major DMA channels that access 

checking functions, the memory controller 62 and a static RAM transfer buffer 130 which is used to store 

memory mapper 68 interface with the memory 53 via 10 data transferred by the disk system. The TCC 124 as- 

address multiplexer and column address strobe buffers signs DMA channels to the BMIC 118, the CPC 120 the 

66 and row address enable logic circuit 64. local processor 122 and to the disk array DMA channel 

The EISA bus 46 includes ISA and EISA control 114. The TCC 124 receives requests from the four chan- 

buses 76 and 78, ISA and EISA control buses 80 and 82, nels and assigns each channel a priority level. The local 

and address buses 84, 86 and 88. Referring to FIG, 2B, 15 processor 122 has the highest priority Level. The CPC 

system peripherals are interfaced via the X bus 90 in 120 channel has the second highest priority level. The 

combination with the ISA control bus 76 from the BMIC 118 channel has the third highest priority level 

EISA bus 46. Control and data/address transfer for the and the disk array DMA channel 114 has the lowest 

X bus 90 are facilitated by X bus control logic 92, data priority level. 

transceivers 94 and address latches 96. 20 The disk array DMA channel 114 is comprised of 
Attached to the X bus 90 are various peripheral de- four disk drive subchannels. The four disk drive sub- 
vices such as keyboard/mouse controller 98 which channels may be assigned to any one of eight different 
interfaces the X bus 90 with a suitable keyboard and disk drives residing in the disk array. The four drive 
mouse via connectors 100 and 102, respectively. Also subchannels have equal priority within the disk array 
attached to the X bus 90 are read only memory (ROM) 25 DMA channel The subchannels are rotated equally to 
circuits 106 which contain basic operations software for become the source for the disk array DMA channel, 
the system C and for system video operations. A serial One of the subchannels is inserted in rotation only if it 
communications port 108 is also connected to the sys- has an active DMA request. The remaining three sub- 
tern C via the X bus 90. Floppy and fixed disk support, channels are always active in the rotation, 
a parallel port, a second serial port, and video support 30 In the operation of disk array controller 112, a re- 
circuits are provided in block circuit 110 connected to quest is preferably submitted to the disk array controller 
the X bus 90. 112 through the BMIC 118. The local processor 122 on 
IT r»ict rvmtmiiAr receiving this request through the BMIC 118 builds a 
^onirouer dm structure m Joca] pTOCessoT RAM memory 128. 

The disk array controller 112 is connected to the 35 This data structure is also known as a command list and 
EISA bus 46 to provide for the communication of data may be a simple read or write request directed to the 
and address information through the EISA bus. Fixed disk array, or it may be a more elaborate request con- 
disk connectors 114 are connected to the fixed disk taining multiple read/write or diagnostic and configura- 
support system and are in turn connected to a fixed disk tion requests. The command list is then submitted to the 
array 116. A schematic block diagram of the disk array 40 local processor 122 for processing. The local processor 
controller 112 upon which the present invention may be 122 then oversees the execution of the command list, 
practiced is shown if FIG. 3. It is understood that the including the transferring of data. Once the execution of 
disk controller set forth in FIG. 3 is for the purpose of the command list is complete, the local processor 122 
illustrating the environment in which present invention notifies the operating system device driver. The submis- 
may operate, The method of the present invention may 45 sion of the command list and the notification of a com- 
be practiced on any personal computer system disk mand list completion are achieved by a protocol which 
array having a microprocessor based, intelligent array uses the BMIC 118 I/O registers. To allow multiple 
controller, and indeed can be performed by the system outstanding requests to the disk array controller 112, 
microprocessor in conjunction with more conventional these I/O registers are divided into two channels: a 
board disk control systems. 50 command list submit channel and a command list com- 

The disk array controller 112 includes a bus master plete channel, 
interface controller 118 (BMIC), preferably an Intel The method of the present invention is implemented 
Corporation 82355, which is designed for use in a 32 bit as application tasks running on the local processor 122 
EISA bus master expansion board and provides all (FIG. 3). Because of the nature of interactive input/out- 
EISA control, address, and data signals necessary for 35 put operations, it is impractical for the present invention 
transfers across the EISA bus. The BMIC 118 supports to operate as a single batch task on a local processor 
16 and 32 bit burst transfers between the disk array 122. Accordingly, the local processor 122 utilizes a real 
system and system memory. Additionally, BMIC 118 time, multitasking operating system which permits mul- 
provides for the transfers of varying data sizes between tiple tasks to be addressed by the local processor 122, 
an expansion board and EISA and ISA devices. 60 including the present invention. Preferably, the operat- 

The disk array controller 112 also includes a compati- ing system on the local processor 122 is the AMX86 

bility port controller (CPC) 120. The CPC 120 is de- Multitasking Executive by Kadak Products Limited, 

signed as a communication mechanism between the The AMX86 operating system kernel provides a num- 

EISA bus 46 and existing host driver software not de- ber of system services in addition to the applications set 

signed to take advantage of EISA capabilities. 65 forth the method of the present invention. 

Also included in the disk array controller 112 is a A greater understanding of the manner in which 

microprocessor 122, preferably an Intel Corporation commands are submitted to the disk array controller 

80186 microprocessor. The local processor 122 has its upon which the present invention may be practiced may 
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be had by reference to currently co-pending U.S. patent 
application Ser. No. 431,737 to Schmenk et al. f entitled 
BUS MASTER COMMAND PROTOCOL, filed 
Nov. 3, 1989 and assigned to Compaq Computer Corpo- 
ration, assignee of the present invention. 5 

III. Background Surface Analysis 

The present invention is a method which instructs the 
local processor 122 to generate a series of read requests 
during periods of disk inactivity which sequentially 10 
read all disk sectors within the disk array, including any 
disk which is dedicated to fault tolerance techniques 
such as mirror or parity fault tolerance techniques. The 
result of the read request is checked by the local proces- 
sor 122 to determine if the read was successful. If not 15 
successful, the present invention notifies the disk con- 
troller of the error. 

A. Restart 

FIG. 4 is a flow diagram of the RESTART task 650 20 
which is activated after the computer system has been 
reset such as after a power off condition. Operation of 
the RESTART task 650 begins at step 651 wherein the 
local processor 122 creates a one-shot timer task using 
an AMX system utility. Control transfers to step 652 25 
wherein the local processor 122 determines whether the 
GLOBAL RIS information is present on the current 
disk. The GLOBAL RIS information describes the disk 
array, including, among other parameters, the number 
of heads, cylinders, sectors, sectors per track and other 30 
information, for all disks within the array. This is 
known as the GLOBAL RIS (reserved information 
sectors) for the reserved disk sectors on the disk where 
such information is stored and is initialized upon com- 
puter system setup. If not present, control transfers to 35 
step 654 and the RESTART task 650 is stopped and 
background surface analysis is not performed. If the 
GLOBAL RIS information is present, control of pro- 
cessor 122 transfers to step 656 wherein processor 122 
calls the INITIALIZE task 600 (FIG. 6) and sets the 40 
RESET code to TRUE. In the INITIALIZE task 600 
the processor 122 determines the starting parameters, 
such as the number of drives to be checked and the 
individual disk parameters, for example, the number of 
heads, tracks and cylinders per drive, the number of 45 
sectors per track. Upon return from the INITIALIZE 
task 600, control of processor 122 transfers to step 658 
where the processor 122 checks to determine if the 
current version of the GLOBAL RIS is greater than the 
initial release version, This is done to determine if the 50 
disk array controller has been configured to include the 
present invention. If the RIS version is less than the 
initial release version which incorporates the present 
invention, a default delay parameter to be used with the 
TIMER task 680 (FIG. 8) is added to the GLOBAL 55 
RIS and the version information is updated to at least 
the initial release version. The TIMER delay may be 
selectively set by the user by causing the host processor 
20 to issue a command which updates the GLOBAL 
RIS with the new TIMER delay period. This is per- 60 
formed by means of a utility program not described in 
this specification. In the preferred embodiment, the 
TIMER delay is fifteen (15) seconds. If it is determined 
in step 658 that the current RIS version is less than the 
initial release version, control transfers to step 660 65 
where the GLOBAL RIS is updated and the TIMER 
delay value is set to the default value. Control is then 
transferred to step 662. If it is determined in step 658 
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that the current RIS version is not less than the initial 
release version, control transfers to step 662 wherein 
processor 122 determines if the number of fault tolerant 
drives within the array, as determined by the INITIAL- 
IZE task 600, is greater than zero. If not greater than 
zero, control of processor 122 transfers to step 664 
where the task is stopped. If greater than zero, control 
transfers to step 666 which initiates the one-shot timer 
which is utilized in the present invention for delay pur- 
poses prior to starting or restarting the actual surface 
analysis. Control then transfers to step 668 which ends 
execution of the task. 

B. Surface Analysis 

FIGS. 5A-5D are flow diagrams of the SUR- 
FACE-ANALYSIS task 500 within the present inven- 
tion running on the local processor 122 which generates 
sequential read requests for all disk sectors within the 
disk array. Operation begins at step 502, wherein pro- 
cessor 122 determines whether the variable NEX- 
T-ACTION to be carried out is the issue of a back- 
ground read request. NEXT-ACTION is initialized to 
issue a read request. If the NEXT-ACTION value is 
the issue of a read request, control transfers to step 504 
where the number of pending logical read or write 
requests is stored to a local variable within the control- 
ler memory 128. Control then transfers to step 506. If in 
step 502 it is determined that the next action to be car- 
ried out by the current task is not an issue read request, 
control transfers to step 506. In step 506 the processor 
122 calls the INITIALIZE task 600. When the INI- 
TIALIZE task 600 is called in step 506, a code indicat- 
ing a NON-RESET condition is sent to the task 600. 
Upon return from the INITIALIZE task 600, control 
transfers to step 508 wherein processor 122 checks for 
the number of fault tolerant drives within the array, 
which is determined by the INITIALIZE task 600 
called in step 506. If the number of drives is equal to 
zero, control transfers to step 510 wherein the processor 
122 halts operation of the current task. If the number of 
fault tolerant drives within the array is non-zero, con- 
trol transfers to step 512 which is a switch based upon 
the next action to be taken. 

If the NEXT— ACTION variable is set to issue a read 
request, control transfers to step 514 (FIG. SB). Control 
then transfers to step 516 wherein the processor 122 sets 
the drive to be checked to the current drive being ad- 
dressed by the task 500. Control transfers to step 518 
which determines whether the current drive is a bad 
drive withi n the array, based upon parameters returned 
from the INITIALIZE task 600 called in step 506. If the 
current drive is a bad drive, control transfers to step 520 
wherein the processor 122 increments the current drive 
pointer to the next drive in the array. Control then 
transfers to step 522. If it is determined in step 518 that 
the current drive is not faulty or bad, control transfers 
to step 522. In step 522, the local processor 122 allocates 
RAM memory 128 for a transfer buffer to return the 
data and condition codes resulting from the read re- 
quest. Control transfers to step 524 wherein the proces- 
sor 122 tests whether there exists sufficient RAM mem- 
ory 128 to allocate for the read request transfer buffer. 
If there is not sufficient memory 128, control transfers 
to step 526 where execution pauses until the TIMER 
task 680 reactivates or wakes up the SURFACE—A- 
NALYSIS task 500. After the TIMER task 680 restarts 
the SURFACE_JVNALYSIS task 500, control trans- 
fers to step 528. The lack of sufficient memory 128 will 
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generally occur when the disk controller 112 is attempt- . request is not dequeued, the local memory 128 could 

ing to perform a number of I/O transfers. Thus, the lack not be deallocated and would rapidly be filled with 

of memory 128 will occur when the array controller 112 completed read requests, as well a new requests and 

is busy. If it is determined in step 524 that there is suffi- data to be transferred. Control transfers to step 542 

cient memory 128 for the transfer buffer associated with 5 wherein the processor 122 deallocates the memory 128 

the read request, control transfers to step 528 wherein which was allocated for return information from the 

the processor 122 generates a logical read request for drive request. Control transfers to step 544 wherein 

the current drive, head and cylinder. In a hard disk, a processor 122 sets the NEXT—ACTION variable to 

track references a radial position on a single rotating issue a new read request. 

disk media. The track itself is composed of multiple disk 10 In order to read all disk memory locations, it is neces- 

sectors which describe a radial portion of the track. The sary that the present invention increment to the next 

sectors per track information is stored within the applicable head, cylinder and drive as required to se- 

GLOBAL RIS information on each disk within the disk quentially read all disk sectors. Control transfers to step 

array. Most hard disk media are composed of multiple 546 wherein processor 122 determines whether incre- 

coaxial disks. Thus, the same track position on each of 15 meeting to the next head will exceed the maximum 

the disks is referenced as a cylinder. When a drive re- number of heads for the drive currently being checked, 

quest is issued for a specific head, cylinder and disk, the If it is determined that the maximum number of heads 

disk and cylinder information are explicitly specified for the present drive will not be exceeded, control trans- 

and the track information is specified by the head asso- fers to step 548, which increments to the next head for 

ciated with the particular disk. In the present invention, 20 the current drive. Control then transfers to step 564 

a read request is intended to read all sectors on a panic- (FIG. 5D). 

ular track. If it is determined in step 546 that the current head is 
Control transfers to step 530 in which the SUR- the last valid head designated for the present drive, 
FACE— ANALYSIS task 500 queues the current read control transfers to step 550, which sets the current 
request to be acted upon by the controller 112. In the 25 head pointer to the first head for a drive. Control trans- 
preferred embodiment, a better understanding of the fers to step 552 wherein the processor 122 determines if 
method of generation of the read request and the sched- the current cylinder is at the last cylinder for the drive, 
uling of the request may be had with reference to previ- If not on the last cylinder, control transfers to step 554 
ously referenced application Ser. No. 431,737. Control which increments the current cylinder to the next cylin- 
of the processor 122 is transferred to step 532 wherein 30 der on the current drive. Control then transfers to step 
the NEXT-ACTION variable is set to check the re- 564 (FIG. 5D). If it is determined in step 552 that the 
suits of the read request and execution is paused until current cylinder is the last valid cylinder for the current 
the read request is completed. After the read request is drive, control transfers to step 556 which sets the cur- 
completed execution recommences and control then rent cylinder equal to the first cylinder for a drive, 
transfers to step 512. 35 Control then transfers to step 558 which determines 
If in step 512, it is determined that the next action to whether the current drive is the last valid drive in the 
be taken by the task 500 is to check the results of the array. If the last valid drive in the array, control trans- 
read request, control transfers to step 534 (FIG. 5C). fers to step 560 which sets the current drive equal to the 
Control transfer to step 536, wherein the processor 122 first drive in the array. If it is determined in step 558 that 
reads the transfer buffer to determine if an error code 40 the current drive is not the last drive in the array, con- 
indicating disk sector failure has been returned by the trol transfers to step 562 which increments to the next 
read request. If a read request has ah error code re- drive in the array. 

turned, control transfers to step 538. An error code will Thus, the SURFACE—ANALYSIS task 500 will 
be returned if any one of the sectors within the track progressively issue read requests for all heads for a 
specified in the read request is faulty. In step 538 the 45 cylinder for a drive, all cylinders for a drive and then 
task 500 notifies the controller of the disk failure. The for all drives in an array. Control transfers to step 564 
failure notification may be acted upon locally by the which calls the BUSY task 700 (FIG. 9) to determine if 
disk controller 112 or may be passed on to the host there are pending disk I/O tasks. Control transfers to 
system via BMIC 118 to permit the operator or operat- step 566 which determines whether the BUSY task 700 
ing system to take any corrective action necessary. One 50 returns a TRUE code. If a TRUE code is returned, 
means of corrective action which may be taken locally control transfers to step 570 where execution pauses 
is the automatic remapping of a faulty sector to a new until reactivated by the TIMER task 680, which waits a 
sector on the same disk. This method of automatically predetermined time after all drive requests are corn- 
remapping a faulty disk sector is described in co-pend- pleted before reactivating the SURFACE—ANAL- 
ing U.S. application Ser. No. 556,646 to Ewert et a]., for 55 YSIS task 500. After reactivation control returns to step 
AUTOMATIC HARD DISK BAD SECTOR 512. If it is determined in step 566 that the BUSY code 
REMAPPING, filed Jul. 20, 1990 and assigned to Com- is FALSE, control transfers to step 568 wherein the 
paq Computer Corporation. However, it is understood processor determines whether the local variable in 
that other methods of corrective action, such as manual which the number of pending I/O requests is equal to 
intervention and regeneration of the data contained 60 the number of currently pending I/O requests. If equal, 
within the faulty disk sector may be taken. Control of no new pending I/O tasks have been received and con- 
the processor 122 transfers to step 540. If its is deter- trol transfers to step 512. If not equal, the SUR- 
mined in step 536 that the completed read request did FACE_ANALYSIS task 500 will conclude that at 
not return an error code, control transfers to step 540 in least one new task has been received by the disk array 
which the processor 122 removes this current read 65 controller 112 which has not been acted upon and con- 
request from the disk controller 112 queue. This is nee- trol transfers to step 570 wherein the SURFACE-A- 
essary to permit the memory .128 which has been alio- NALYSIS task 500 goes inactive until reactivated by 
cated to the read request to be deallocated. If the read the TIMER task 680. 
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Thus, the SURFACE—ANALYSIS task 500 will the array, control transfers to step 628 which returns to 

continue to loop and sequentially check all sectors the calling task, 
within the disk array in a background mode. If the disk 

array is actively processing I/O requests, the SUR- D - Schedule Operations 
FACE—ANALYSIS task 500 will go into an inactive 5 FIGS. 7A and 7B are portions of a procedure used to 

state for a predetermined period of time. process logical requests sent to the disk array. These 

C Initialize portions are used to manage the operation of the 

moc * a « rx\ « ,. , , , VTT ^ T A „ TIMER task 680 in the present invention. In the pre- 
S -«Aand«B^flowduigr a n 1 softf.eINITIAL- ferxed embodiment , ^ procedures we „ 
IZE task 600 which » used to set the mitud parameters .0 ^ of ^ ^ & ^ ^ ^fon 
for the generation of the read requests by the SUR- , , « . aiia 7 nmiuu fi uu 
FACE-ANALYSIS task 500. C^ration of the ^ fT'VS'' * T*™^*" 
TIALIZE task 600 begins at step602 wherein the pro- ^ * " md ^ D ? cnt fimc ' 
cesser 122 determines whether information which de- ^ ^r the nmagement of tmimg events m the present 
scribes the disk array, including parameters such as 15 ^ctA' *a * * AAA . , , 
heads, cylinders, sectors, sectors per track and other ?° 7A >, at stc P the ^ Processor 122 re- 
information, for all disks within the array is present. °* iy/es a re 9 uest Control transfers to step 802 
This is known as the GLOBAL RIS (reserved mforma- wh « ein the local processor 122 determines whether the 
tion sectors) for the reserved disk sectors on the disk currcnt lo &d r «l ucst is equal to zero and 
where such information is stored and is initialized upon 20 wheA er the TIMER as initialized in the RESTART 
computer system setup. If not present, control transfers 650 or elsewhere is running. If both conditions are 
to step 604 and operation of the task ceases. If the control transfers to step 802, wherein the local 
GLOBAL RIS is present, control transfers to step 606 processor 122 stops and clears the one-shot timer task 
where the processor 122 determines whether a RESET operating under the control of the AMX operating 
code, which is set by RESTART task 650, is present. If 25 system. Control is then transferred to step $06 which 
the RESET code is present, control transfers to step increments the logical request count. Thus, when the 
608, which sets all parameters to the first head, first local processor 122 receives a logical request, the one- 
cylinder and first disk within the array. Control then shot timer task is halted and the present invention goes 
transfers to step 610. If it is determined in step 606 that inactive. Control then proceeds to other processing 
no RESET code is set, such as when the INITIALIZE 30 steps necessary when a request has been received, 
task 600 is called by SURFACE-ANALYSIS task 500, In FIG. 7B, the section of code represents the corn- 
control transfers to step 610 where the processor 122 pletion of a logical request. In step 810, the local proces- 
initializes a disk counter. Control transfers to step 612 sor 122 decrements the logical request count upon com- 
which sets the current logical volume to the volume of pletion of the logical request. Control transfers to step 
which the current disk is a member. Control transfers to 35 812 wherein the local processor 122 determines 
step 614. whether the logical request count is equal to zero and 

In step 614, the local processor 122 determines the current logical unit is set for either mirror or parity 

whether the current unit is a valid logical unit for the fauh tolerance mode and whether the one-shot timer 

array and whether the current unit is part of a logical ^ has stopped. If all of the above conditions are 
7 w ? ^ f i techni£ * ue - } x 40 true, control transfers to step 814 wherein the local 

should be noted Oat a logical unit identifies no only the processor 122 resets the one-shot timer task to the time 

fault tolerant disk associated with the unit, but thedata £ dicated fa ^ GLOBAL RIS and starts the timer 

disks as well. If true, control transfers to step 616 (FIG. ninn ; nn u;. _ . . # , . . ~7 

6B) wherein processor 122 initializes all variables asso- S5S«' t *Vfr completes the 

ciated with .disk request, such as number of rcTds, 45 TIMER i^k 680 u oUled. If «ic^^uest is 

cylinders, disks, etc. to the disk parameters stored 1 »™£ T J^^**^^ 

within the GLOBAL RIS. Control transfers to step 618 ™P 2 W f f nte . r a T which *»f » * 

wherein the processor 122 tests whether the current ^L 01 **? 11 * one - shot tuner as set forth in 

cylinder is set to cylinder zero, which would indicate a *f G ' 7A - ^ the P rescnt invention will reset and start 

drive which has never been used, or whether the 50 ^one-shot timer upon completion of all pending oper- 

RESET code is TRUE. If either condition is true, con- ^± °»***">t timer task completes, the 

trot transfers to step 620 which sets the current cylinder T1MER task 680 ltsclf called- 

equal to cylinder 1. Control thereafter transfers to step £. Timer 
622. If both conditions are false in step 618, control of 

processor 122 transfers to step 622 wherein the valid 55 F1G ; 8 » a . flow diagram of the TIMER task 680. 

fault tolerant drive count is incremented. Control then Operation begins at step 684 following the completion 

transfers to step 626. If in step 614, it is determined that of the one-shot timer task wherein the local processor 

the current unit is not a valid logical unit or that the 122 ca ^ s BUSY task 700 to determine if the disk 

current unit is not set for fault tolerant operation, con- controller 112 is performing I/O operations. Upon re- 

trol transfers to step 624 (FIG. 6B), in which the drive 60 turn from the BUSY task 700, control transfers to step 

parameters are set to indicate that the current drive is wherein processor 122 determines whether the disk 

also faulty as a result of the current logical unit being controller 112 is, in fact, busy. If the BUSY code is 

faulty. Control transfers to step 626. In step 626, the TRUE, control transfers to step 692, where the TIMER 

local processor 122 determines whether there are addi- task 680 is halted. If the BUSY code is set to FALSE, 

tional drives within the array. If there are additional 65 control transfers to step 688 wherein processor 122 

drives, control of the processor 122 transfers to step 630 reactivates operation of the SURFACE-ANALYSIS 

which increments to the next drive and control then is task 500. Control transfers to step 690 where the 

transferred to step 614. If there are no further drives in TIMER task 680 is halted. 
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P g (d) checking the results of the read operation to deter- 

y mine if the read operation has failed and upon de- 

FIG. 9 is a flow diagram of the BUSY task 700, which tection of a failure, indicating the disk media failure 

is used by the present invention to determine if the disk for the current disk memory location; 

array is carrying out a particular type of I/O operations 5 (c) incrementing to a successive disk memory loca- 

referred to as regeneration. Operation begins at step 702 tion; and 

wherein the local processor 122 determines if the (f) continuously performing steps (b) through (e) for 

GLOBAL RIS parameters are set to null which would all disk memory locations for all disks located 

indicate a bad or missing disk. If yes, control transfers to within the disk array. 

step 706. If it is determined in step 702 that the 10 2. The method of claim 1, wherein the step of initial- 
GLOBAL RIS parameters are present, control trans- j^S disk memory location further includes determin- 
fers to step 704 where the local processor 122 tests to ing if the computer system has been reset and if having 
determine if the current volume is currently undergoing ^ ccn rcsct » initializing the current disk memory location 
data regeneration. The preferred embodiment will indi- a predetermined disk memory location within the 
cate in the GLOBAL RIS structure when a drive has array- 
been replaced and/or data for a drive is being regener- . 3 *. ^ «ethod of claim 1 wherein the step of deter- 
ated. In such instance, it is inadvisable to direct a read mining if the disk array is active further includes deter- 
request to a drive that is being regenerated. If the drive niining if the disk array controller has received but not 
is being regenerated, control transfers to step 706. In ^^"P 011 « input or output operation request, 
step 706, the BUSY code is set to TRUE and control 7' 1116 mcthod of claim 3, wherein the step of deter- 
transfers to step 710. If it is determined in step 704 that 20 numng whethcr disk a™y * active further includes 
the current volume is not being regenerated, control determining whether the disk on which the current disk 
transfers to step 708 which sets the BUSY code to mem ory location is located is currently being regener- 
FALSE. Control then transfers to step 710 wherein the ated ' „ . _ , . • 
processor 122 determines whether there are additional t 5, F ° r . use with a computer system having a fault 
logical volumes within the disk array. If there are addi- 25 tolerant, ^intelligent mass storage disk array subsystem, a 
tional volumes within the disk array, control transfers me [ 0T P"* 0 ™** disk sector analysis for all drives 
to step 712 which increments to the next logical vol- ™tlun the dl ? k . array, the steps comprising: 
ume. Control is then transferred back to step 702 which (a) determining disk and disk array parameters and 
will cause the BUSY task 700 to loop until all logical n^^l 8 . totUS mfomat J lon i 
volumes have been checked. If there are no additional 30 <M mitialmng a current disk memory location based 
logical volumes within the array, control transfers to u P° n th f dlsk ft ™y and dlsk dnve status mforma- 
step 714 which returns control of the local processor , t»on and current disk parameters; 
122 to the calling task. dctenmm ng whether the disk array is in the pro- 
cess of carrying out disk operations, and if carrying 
IV. Conclusion „ out such operations, suspending operation of the 
It will be appreciated that the method of issuing and of the present mvention for a predetcr- 
performing background read requests directed to all Tt ^ ° *? ! « ^ ste P s ^ 
drives within a disk drive array, Lluding drives dedi- .J* °t*? tl °™ ?*™ d ^ u 
cated to parity and mirror fault tolerant techniques will ^f^^l^/ ^ ^ h™™ 1 
increase the reliability of the disk array. Failures on a 40 &t ^Ti>^w ^ ° F 

F™? 5 rege f n T° n f ^ mine if the read request has failed and upon detec- 

y iecton - w - tion of a failure, irfdicating the disk media failure 

mapped to valid sectors on the disk. for the current disk mcmo * locati 

ven^ar^^ « » * ~ ^ *- 

various changes m the size, shape, materials, compo- w continuously performing steps (c) through (0 for 

nen s, circuitry wmng connecuons and contacts, as ^ disk mcmo £ locati * ns f or K ^ ^ lo \4 ted 

well as m the details of the illustrated circuitry, con- ^thin the disk array 

struction and method of operation may be made with- 6 . mcthod tf'dmttts. wherein the step of initial- 

outdepartmg from the spint of the mvention. ^ ^ memory location includes determin- 

wnat is claimed is: ing if the computer system has been reset and if having 

1. For use with a computer system having a fault been resctt initializing the current disk memory location 

tolerant, mtelhgent mass storage disk array subsystem to a predetermined disk memory location within the 

having individual disk and disk array parameters and array. 

associated status information for the disk and disk array, 55 7. The method of claim 5 wherein the step of deter- 

a method for performing disk sector analysis for all mining if the disk array is active further includes deter- 

dnves within the disk array, the steps comprising: mining if the disk array controller has received but not 

(a) initializing a current disk memory location based acted upon a input or output operation request, 
upon the disk array and disk drive status infonna- 8. The method of claim 7, wherein the step of deter- 
tion and current disk parameters; « inining whether the disk array is active further includes 

(b) determining whether the disk array is in the pro- determining whether the disk on which the current disk 
cess of carrying out disk operations, and if carrying memory location is located is currently being regener- 
out such operations, suspending operation of the ated. 

method of the present invention for a predeterr 9. The method of claim 5, wherein the step of deter- 
mined period of time and repeating this step (b) 65 niining disk and disk array parameters further includes 
until disk operations are not being carried out; reading disk and disk array parameter information from 

(c) performing a read operation on the current disk a reserved disk memory location of a disk, 
memory location; * • * • * 
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